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Abstract 

We study local energy minima of twenty (L-alanine). The minima are gener- 
ated using high-temperature Molecular Dynamics and Chain-Growth Monte Carlo 
simulations, with subsequent minimization. We find that the lower-energy config- 
urations are a-helices for a wide range of dielectric constant values (e = 1, 10,80), 
and that there is no noticeable difference between the distribution of energy minima 
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in (j)ip space for different values of e. Ultrametricity tests show that lower-energy (a 
-helical) e = 1 configurations form a set which is ultrametric to a certain degree, 
providing evidence for the presence of fine structure among those minima. We put 
forward a heuristic argument for this fine structure. We also find evidence for ultra- 
metricity of a different kind among e = 10 and e = 80 energy minima. We analyze 
the distribution of lengths of a-helical portions among the minimized configurations 
and find a persistence phenomenon for the e = 1 ones, in qualitative agreement with 
previous studies of critical lengths. 
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1 INTRODUCTION 

There is a number of reasons for studying the structural properties of energy minima of 
poly(L-alanines). We shall cite here just a few: 

1. It has been recently confirmed [|l[] that poly(L-alanine) may serve as a good model for 
many proteins; namely, that the native backbone geometry for a polypeptide or protein 
of M residues very often has a closely related metastable analog in a poly(L-alanine) 
of the same number M of residues. The proteins for which this hypothesis has been 
successfully checked include bovine pancreatic trypsin inhibitor (M=58), crambin (M=46), 
ribonuclease A (M=124), and superoxide dismutase (M=158). 

2. Our studies of hepta(L-alanine) @ showed that all energy minima group themselves 
into dense clusters in configuration space, and the extension of this behavior to longer 
molecules is quite plausible and supported, indeed, by the results of the present paper. On 
the other hand, there is a general belief that the global minimum of poly(L-alanine) in the 
vacuum is an a-helix for sufficiently long molecules. Therefore, we have reasons to expect 
that there are completely or partially a-helical clusters in the low-energy configurations 
of any poly(L-alanine), and it would be interesting to find more about the cluster fine 
structure, if any. 

3. If a-helices are in fact the lowest-energy conformations, why is that so? Is it due 
mostly to steric or electrostatic effects? What is the influence of the dielectric constant? 

In this paper we study these and related questions, taking as an example twenty(L- 
alanine). This molecule is sufficiently long that a-helices become the lowest-energy con- 
figurations (contrary to the hepta-alanine case 0), but short enough for an extensive 
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computational study. The energy-minimized configurations are obtained by computer 
simulation and minimized at three different values of dielectric constant: e — 1, 10, 80. 
We find that, indeed, there is evidence for fine structure among the e = 1 a-helical 
(lowest) minima: the set of such minima seems ultrametric. We also find evidence for 
ultrametricity (of a different kind) in the e = 10, 80 cases. We study the a-helical content 
of minimized configurations, and find that for e = 1, there is "persistence" of an a-helical 
structure once it has reached a certain length; this is consistent with earlier studies of 
critical length for a-helix formation and distribution of a-helical portions of longer chains 
HI HI HI @- We show that a-helices are the lowest-energy structures independently of 
e, but that the reason for this property - the relative role of different terms in energy 
- does in fact depend on e. We also note that the value of the dielectric constant does 
not much affect the position of energy minima in the (pip space: the (pip-space clusters of 
energy minima stay very much in place. 

The paper is organized as follows. In Section 0, we briefly describe the model and the 
sampling methods. We have generated Boltzmann-distributed configurations of twenty (L- 
alanine) at various temperatures. These configurations were obtained by two methods: 
(1) the Langevin Molecular Dynamics simulations run with CHARMM 0, || and (2) the 
Monte Carlo Chain-Growth method described in ||. Subsequent minimization was done 
with the Adopted Basis-set Newton-Raphson algorithm (ABNR) under CHARMM. 

Section |3| is dedicated to the results. In particular, ultrametricity (the evidence for 



fine structure among lower-energy minima) is discussed in Section pT3| . The statistics 



concerning a-helical content among the minima, is presented in Section 3.4, and the role 



4 



of various energy terms for stabilizing the helical conformations is discussed in Section |0 
We end with a short conclusion. 
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2 THE MODEL AND THE METHODS 

In this section we describe the model and the sampling methodology. 

Exactly as we did for hepta(L-alanine) ||, the 20(L-alanine) molecule is represented 
as C0 2 -CH(CH3)-NH-(CO-CH(CH3)-NH) 18 -CO-CH(CH 3 )-NH3. The atom numbering 
scheme and chemical structure are shown in Fig. 1. 

The charges on the carboxyl and ammonium groups are set to zero so as to avoid 
dominating charge-charge interactions. In this way the principal electrostatic interactions 
will result in hydrogen-bonding between backbone peptide groups. 

2.1 The energy function 

For both methods mentioned above, we use the same energy function as described previ- 
ously §, g: 

E= J2 h(b-b ) 2 + ]T k e (9-9 ) 2 + Y, k^l + co 8 [mf>-S\) 

bonds angles dihedrals 

+ £ ^--o) 2 + E^4^) 12 -fe)VE^[el4(.j)] (1) 

impropers i<j \ V / \ 'J / i,j l 

where distances are in A, angles in radians, and E is in kcal/mol. 

In Eq. (1) b, 9, <fi and v are the bond lengths, angles, dihedrals and improper torsions 
and b , 9 , z/ are the reference values for these properties. The force constants associated 
with these terms are k , kg, k^ and k v . For the intrinsic torsions n is the symmetry number 
of the rotor (e.g., 3 for a methyl group) and S is the phase angle. 

The nonbonded terms in Eq. (1) are pairwise additive and consist of a 12-6 Lennard- 
Jones van der Waals term and a Coulombic term representing electrostatic interactions 
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between atomic point partial charges. These nonbonded interactions are included for 
1,4 (vicinal) or higher order atom pairs. The quantities r^, e^, cr^-, g.; and qj are the 
nonbonded distance, the Lennard- Jones well depth, the Lennard- Jones diameter and the 
charges for atom pairs i and j. The factor [el4(z, j)} in Eq. (1) is 0.5 for (1,4) pairs and 
1 for all other pairs of atoms. A distance- independent relative dielectric constant e of 1, 
10, or 80 is assumed. The parameters used in the energy function can be found, e.g., in 

§• 

2.2 Sampling Methods 

We made use of two sampling methods here: (1) traditional Molecular Dynamics, and 
(2) Monte Carlo Chain-Growth method ||. Notice that our aim was obtaining as many 
as possible different configurations belonging to different basins of attraction of the mini- 
mizing procedure, rather than obtaining good-quality Boltzmann ensembles. This is why 
it was quite appropriate to use finite-temperature configurations produced with one value 
of e as starting points for an energy minimization at this and other values of e. Heuristi- 
cally, this idea was due to the following consideration: in the approximation in which the 
molecule's energy is dominated by electrostatics, changing e is equivalent to changing the 
temperature. 

Changing the value of e for the purposes of minimization may seem a rather unusual 
way of handling a sampling procedure. One may worry that this could result in an 
undesired biasing of the ensemble of minima. Such a consideration would be appropriate 
if, in doing so, we could lose information. In the case at hands, however, we could certainly 
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only gain in finding more minima, wherever they come from. Namely, without using this 
procedure, one obtains the deepest energy minima at all values of e in the form of a- 
helices. However, the statistics is too poor, and one would like to increase it. For this 
purpose, we observe that all (e = regenerated configurations converging to a helix upon 
minimization, belong to the basins of attraction of e = 10 and e = 80 a-helical minima as 
well. In fact, this procedure gave us as many e = 10 and e = 80 a-helices as the standard 
procedure did: half of those, in our simulation, were produced by a minimization starting 
from e = 1 strongly-biased Monte-Carlo generated configurations. 

Thus, in this way we obtained a better representation of the minima population for 
each value of e without having to perform more high-cost finite-temperature MC or MD 
simulations. 

(1) MC Chain-Growth Method 

A detailed analysis of the Monte Carlo Chain-growth method can be found in 0. In 
this procedure, an ensemble of chains (molecules) is grown atom by atom, so as to generate 
an ensemble which (in the large-ensemble limit) obeys the Boltzmann distribution. One 
advantage of this method is that it does not assume any particular guess for the initial state 
but rather builds up the chain according to Boltzmann weights. Several such Monte Carlo 
runs at various temperatures (300K, 1000K, and 6000K) and values of e were performed, 
giving a population of 86 different configurations produced at e = 1, 36 configurations at 
e = 10, and 16 configurations at e = 80. 

In one of the e = 1 MC simulations, yielding 16 different configurations, we used a 
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biased (guiding field) version of the chain-growth method |L0|. Minimization of these at 
all three values of e invariably produced completely a-helical configurations. 

A typical CPU time necessary to produce one configuration was 0.7 s on a Cray 2. 



(2) Molecular Dynamics 

MD simulations were performed using the program CHARMM [|7| with the Langevin 



Dynamics algorithm [II]]. In this approach, the equation of motion contains a frictional 
term and a Gaussian external force. In a one-dimensional representation the equation is 
as follows, 

mx = -U'{x) -72; + /(*), (12) 

where the friction coefficient 7 and the force correlation function < f(t)f(0) >= C5(t) are 
related by C = 2 r yksT. In our simulations, the values of 7/m = 40sec _1 and T = lOOOfC 
were used. We performed two such simulations, with different starting points (a-helix 
and /3-sheet), for each of the three values of e. For e = 1, one simulation of 400 ps and 
one of 297 ps were performed. For e = 10, the two runs were 100 ps and 150 ps long. For 
e = 80, they were 150 ps and 200 ps long. The configurations to be minimized were taken 
each 1 ps of the MD trajectory. 

A typical 100-ps MD simulation took the average of 37 hrs CPU time on an Alliant 
VFX-80 machine. 



(3) Minimization 

As we have already mentioned, we used configurations produced at each value of e for 
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minimization both at that and other e values. 

In this way, we obtained 799 e = 1 configurations, 272 e = 10 configurations, and 452 
e = 80 configurations to be minimized. The minimization was done in two stages. To 
save CPU time, and because we are more interested in low-lying minima, a preliminary 
minimization of 2000 cycles of Adopted Basis-set Newton-Raphson (ABNR) procedure 
was performed on all of the above configurations. Then these partly-converged minima 
were ordered in energy, and a set of 2-dimensional 4>ip plots was obtained. Then a num- 
ber of lower-energy configurations (see Table I) were chosen to be minimized until the 
rms energy derivative was 10~ 6 kcal/mol A or smaller. (The partly minimized configura- 
tions were so close in configuration space to actual minima that there was no noticeable 
difference between the (pip plots of partially and completely minimized configurations). 
This second stage of minimization took from 3000 to 15000 cycles of the ABNR. Natu- 
rally, some configurations converged to identical minima. The total count of initial and 
minimized configurations is given in Table I. 

Typically, minimization took about 3 hrs per 1000 ABNR steps CPU time on an 
Alliant VFX-80 machine, i.e. 6 hrs for the first, 2000-step stage, with 9 to 45 more hours 
during the minal minimization stage. 
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3 RESULTS 



3.1 ^-distribution and classification of minimum-energy struc- 
tures 

The obtained (pip-maps are very similar among themselves. There is not much difference 
between the maps of different (pip planes, nor between the maps corresponding to different 
values of e; we present two such maps, for e = 1 and e = 80 (Fig. 2a, b). All of the 
maps show clusters of points located in the same regions as those previously obtained for 
hepta(L-alanine) 0. Moreover, we know that their localisation corresponds to the energy 
minima of an alanine dipeptide 0. This means that the long-distance cross-residue effects 
do not affect the location of the minima in the first approximation. 

This fact allows us to use the techniques introduced earlier in 0, and characterize 
a configuration by a set of 18 digits varying from 1 to 9. Namely, first we consider the 
configuration as belonging to the 36-dimensional space spanned by the 18 inner pairs of 
<p and ip backbone angles. This makes our study much easier than if we considered the 
whole (3N-6)= 603-dimensional coordinate space. Then we divide each (pip plane into 
rectangles as shown in Fig. 3. The division is such that individual clusters of points on 
the two-dimensional maps (Fig. 2) do not share rectangles. This classification divides 
the 36-dimensional space into 9 18 rectangular "boxes", each of which can be labeled by a 
sequence of 18 integers referring to the consecutive (pip planes, e.g. {56157611329 
7244968} etc., so that we can label all minima as belonging to this or that box. This 
classification provides an easily observable characteristic of a structure, so that, e.g., one 
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can see at a glance the change that a given configuration undergoes during minimization 
(Table II). In our opinion, such a representation is often preferable to two-dimensional 
molecular tracings. While, looking at the latter, it is quite difficult to observe that a 
given structure possesses, say, two a-helical pieces joined by a non-a-helical one, one can 
immediately see this fact if one encounters a structure labeled {1222217222222 
2 2 2 2 2}. Moreover, we shall often simplify the notation by writing the latter structure 
as {1 2^ 4 ) 1 7 2 1 - 11 )}, like we did in ref. | JLU[| , so that a complete a-helix becomes {2^ 18 ^}. 

3.2 Characteristics of the low-energy minimized configurations 

For each value of the dielectric constant, we have considered the obtained energy-minimized 
configurations. As we have just said, the <j)ip distribution of minima is not much affected by 
the value of e. Contrary to the case of hepta(L-alanine), the lowest-energy configurations 
always are complete a-helices. This is illustrated in Tables III, IV, V for e = 1,10,80, 
respectively. We included in Table III an artificially obtained left-handed helical structure 
(3^ 18 ^). One can see that its energy is much higher (by about 40 kcal/mol) than those of 
a-helices. 

The chain tracing of the lowest-energy e = 1 structure is given in Fig. 4. The hydrogen 
bonds are given as broken lines. One should remember that there is no explicit H-bond 
term in the CHARMM energy function: H-bonds are simulated by a combination of the 
electrostatic and Lennard- Jones terms. 

Of the 19 e = 1 complete a-helices obtained after minimizing the 799 initial configu- 
rations, 5 are the result of MC simulations, of which 3 were the biased e = 1 MC runs, 
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and the other 2 were nonbiased MC e = 80 runs minimized at e = 1. The remaining 14 
configurations are the result of e = 1 MD simulations starting from an a-helix. 

All the 12 e = 10 complete a-helices obtained after minimizing the 272 initial con- 
figurations are the result of MC simulations, of which 6 were the biased e = 1 MC runs 
minimized at e = 10, and the other 6 were nonbiased MC e = 10 runs. 

All the 10 e = 80 complete a-helices obtained after minimizing the 452 initial con- 
figurations are the result of MC simulations, of which 5 were the biased e = 1 MC runs 
minimized at e = 80, and the other 5 were nonbiased MC e = 80 runs. 

The energy differences between the highest and lowest-energy a-helices (2( 18 - ) configu- 
rations) are: 11.4 kcal/mol (e = 1), 4.1 kcal/mol (e = 10), 2.0 kcal/mol (e = 80). 



3.3 Ultrametricity 

We find dense clusters of minima in configuration space. This phenomenon has already 
been studied for small folded globular proteins using MD |L2| and MC [l3j techniques. A 



thorough analysis of the clusterization phenomenon was conducted in our previous paper 
0. It has already been noted more than once (e.g. |1~2]| ) that the complexity of the 
configuration space and the existence of multiple minima with small energy differences 
bears resemblence to the case of spin glasses. It comes to mind, therefore, that it would 
be useful to verify if our distribution of minimized conformations shares with spin glasses 
one of their interesting properties, i.e. ultrametricity [13]. We remind that a simplistic 



notion of ultrametricity can be made by imagining a series of clusters within clusters: they 
form a tree-like structure, so that clusters are present at an arbitrary scale. A preliminary 
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analysis, for the case of folded myoglobin, can be found in |12| , where no direct evidence 
for a tree structure was found. 

In our previous paper on hepta(L-alanine) 0, we looked into the question whether 
the minimized configuration set forms an ultrametric ensemble. No evidence for such 
a property was found there. We are posing the same question for the present case of 
twenty (L-alanine). In the e = 1 case, the answer remains negative, if one considers the 
whole set of energy-minimized configurations; and it is positive, if one restricts the set 
to the lower, completely or partially a-helical configurations. In the e = 10 and e = 80 
cases, the situation is somewhat different. 

To test if the 20-alanine distribution is ultrametric, we use the distance between the 
minima. There is no unique choice of such a distance. A natural choice for peptides is 
the distance in 36-dimensional <j)ip space, defined as follows : 

18 

i 

corrected to take into account the periodicity of and ip coordinates: if \4>i — 4>'i\ > 180°, 
it is substituted by (360° — \<j>i — <^|), and the same applies to if). 

Considering all possible triplets of energy minima, and forming triangles with vertices 
at each minimum, one then plots the length of the second longest side of each triangle 
against the length of the longest side. In the ideal ultrametric case, the points will lie on 
the diagonal y = x. If no ultrametricity is present, the points will fill the triangle between 
the diagonal and the line y = x/2. (The lower bound is a trivial result of the triangle 
inequality: if a, b, c are sides of a triangle, a < b + c). If ultrametricity is present to some 
extent, the density of points will become greater towards the diagonal. 
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In Figs. 5, 6, and 7 we present such plots for e = 1, e = 10, and e = 80 cases. The 
computations were done for samplings of 133 out of 799, 69 out of 272, and 61 out of 452 
configurations, respectively (see Table I). Namely, in the e = 1 and e = 10 cases, we took 
all fully minimized configurations for this analysis. In the e = 80 case, we felt that 18 
fully minimized configurations would give us too poor a statistics, so that we added 43 
upper, partially minimized configurations. 

In the e = 1 case, due to the printer's memory limitations, we could not present 
all the Cf 33 =383306 triplets on the graph, so that Fig. 5 shows a subset of C| 7 =47905 
triplets generated by taking each second configuration of the 133 configurations used 
in computation. The center of inertia of the actual distribution of 383306 points is at 
(x = 241.0378, y = 211.8622), i.e. on the line y = 0.88x. The ratio of the moments of 
inertia of the point distribution along the principal axes is 18.0834. 

For e = 10, the plot consists of Cf 9 =52394 triplets. Its center of inertia is at (x = 
450.6481, y = 423.6821), i.e. on the line y = 0.94a;. The ratio of the moments of inertia 
of the point distribution along the principal axes is 72.107. 

For e = 80, the plot consists of (7^=35990 triplets. Its center of inertia is at (x = 
599.8134, y = 577.8925), i.e. on the line y = 0.96a;. The ratio of the moments of inertia 
of the point distribution along the principal axes is 29.005. 

These results are to be compared with a similar point distribution generated by a 
random set of points in the 36-dimensional 0^ space. In that numerical analysis 

gives the ratio of the moments of inertia of 4.0067. This seems to constitute a clear 
evidence for calling the above distributions of lower minima ultrametric to a large degree. 
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At the same time, we have to stress the difference between the case of e = 1 lower- 
energy configurations and the two other cases. 

As is well known from the theory of spin glasses |14| , [JOJ , one can define roughly two 
types of ensembles with ultrametric properties: 

(i) for the first type (which corresponds to the high-temperature phase in the spin-glass 
case), the triangles in the configuration space are mostly equilateral; 

(ii) the second type (which corresponds to the low-temperature phase in the spin-glass 
case) has, in addition, an isoceles-triangles component. 

For e = l, the 133 lower configurations are mostly a- helical with the third, non- 
plotted side of the triangle being usually much smaller than the other two (second type 
ultrametricity) . By contrast, for e = 10 and 80, only a small fraction of the configurations 
taken for ultrametricity analysis are helical, since, as mentioned above, our statistics 
in these two cases is rather poor. In these cases the majority of triangles have three 
comparable sides (first type ultrametricity). To check this point, we considered separately 
495 upper e = 1 configurations with no a-helices present at all. In this case, we obtained a 
plot of Cf 95 =20092215 triplets of points, with the center of gravity at (x = 507.1641, y = 
469.5677) (lying on the line y = 0.93a;), with the ratio of the moments of inertia of 21.6; 
furthermore, we found that the third non-plotted side of the triangle is comparable to 
the first two. We thus conclude that for all values of e, there is a first type ultrametricity 
among the non-a-helical configurations. 

Much more interesting is the ultrametricity among highly-ordered, a-helical config- 
urations that we see in the lower-energy e = 1 case. One may try to put forward a 
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qualitative argument why ultrametricity could be expected here. Suppose that an inde- 
pendent change in the (pip coordinates corresponding, say, to one turn of the helix can 
drive the molecule from one minimum to another one with a much smaller change in other 
(pip coordinates. Consider one particular energy minimum corresponding to an a-helix ( 
minimum (1) in Fig. 8). Now consider another minimum corresponding to a small change 
in the position of one turn of the helix (minimum (2) of Fig. 8). This minimum is very 
close in (pip space to minimum (1). Still another minimum can be obtained by a small 
change in the position of two turns of the helix (minimum (3) of Fig. 8). This minimum 
is further away from minimum (1) or (2), but there will be other minima closer to it ob- 
tained from minimum (3) by a small change in the position of only one turn of the helix 
(minimum (4) of Fig. 8). Changing three turns of the helix, we obtain a minimum still 
further away (minimum (5) of Fig. 8). In this fashion, we obtain a hierarchy of clusters 
(called level-0, level-1, level-2 etc... cluster in Fig. 8), corresponding to a change inside 
a given turn of the helix, in one more turn, in two more turns etc... Even though this 
model is necessarily simplistic, and the reality may be organized in a somewhat more 
complicated way, it provides an insight into the reasons for existence of a fine structure 
among this type of configurations. 



3.4 a- helical content: persistence of a- helical structure 



A definition of what is an a-helical portion of a secondary structure differs from one 



author to another |TJ|, ]T7|], fT5 |. In our approach, it is natural to adopt the definition 
stemming from our way of labeling configurations (Section 3.1). Namely, a configuration 
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is said to have an a-helical piece of length n if its label contains a 2^ part. We shall 
also say that such a configuration has an a-helical content of n. The reason for choosing 
this definition, rather than a definition based on H-bonds or on explicit variation of <f> and 
if) angles (like in [|18|]) is that due to the clustering phenomenon, all such definitions are 
equivalent, and ours is much easier to use. 
(1): e = 1 case. 

We analyzed the relative distribution of configurations with different a-helical content 
n. In Fig. 9 we show how many e = 1 minimized configurations contain a-helical pieces 
of given n. (Our convention is such that, e.g., a chain labeled {2 < - 2 - ) 1 5 7 3 4 8 4 2^ 3 7 
8 2 < - 3 - ) }, which contains one 2-chain of twos and two 3-chains of twos, will contribute once 
into the count for 2-chains and once into the count for 3-chains. Other conventions could 
be envisioned.) We see that the distribution has a deep at n—9. 

This means that in this strong electrostatics case of e=l, we encounter the phenomenon 
of persistence of the a-helical structure: once it has reached n=10, there is a tendency 
for the a-helical chain to become longer, but there is no such tendency for smaller n. 

These results qualitatively agree with the known existence of a critical length l c , below 
which an a-helix is not the lowest-energy state of a polyalanine. Using a potential different 
from ours, an l c of 16 was found in . 

The analysis can be pushed somewhat further. Dividing all configurations into those 
which contain a-helical pieces with n > 9 and those with n < 9, we plot two energy 
histograms, marking the number of configurations of each kind falling in a given energy 
bin. These histograms are plotted in Fig 10. We can see that there is a very small overlap 
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of the two histograms: the two kinds of configurations are quite well separated in energy. 
Another way to see this energy separation is to plot the entire energy range spanned by 
each set of configurations of a given helical content n. This plot is presented in Fig. 11. 
(2). Other values of e. 

The helical content behavior differs significantly for other values of e. The helical 
length histograms similar to that of Fig. 9 are given in Figs. 12,13. Since no change in 
the chain character ever occurred during the second minimization stage, and in order to 
increase the statistics, we took all minimized chains, rather than only the fully minimized 
ones, for this analysis. This may have resulted in a multiple count for those partially 
minimized chains which after full minimization converged to identical minima. This does 
not influence the qualitative character of the histogram. Here it is especially interesting 
to look at the helical content distribution with and without taking into account the chains 
produced by the biased MC method, which searched specifically for a-helices. Unlike 
in the e = 1 case, where complete a-helices (n=18) could easily be found by whichever 
method, here only very few (6 out of 23) are found by nonbiased methods for e = 10, 
and 6 out of 24 for e = 80. (Remember that this count is different from the count of 



Section |3T2| , where all configurations converging to identical minima were identified). The 
helical content distribution falls rapidly for e = 80, with some hints of a rise around 
n — 16 for e = 10. Nevertheless, the n — 18 full a-helices remain the deepest minima 
in all cases: it remains to find them. This means that while the position of the deepest 
minimum cluster does not change much with e, its width falls down rapidly. 

Energywise, the deepest minimum found is -113.285 kcal/mol for e = 10, with rea- 



19 



sonably helical configurations running up to -105.599 kcal/mol, and -68.519 kcal/mol for 
e = 80, with reasonably helical configurations running up to -63.525 kcal/mol. 

3.5 The role of various energy terms in stabilizing configurations 

The energy components for a sample of 11 lower-energy minimized configurations are 
given in Table VI. These configurations are completely or almost a-helical. We can see 
that the Coulombic interaction is about 0.92 of the total energy, and the van der Waals 
interaction about 0.09 of the total. The other components can almost be neglected. Even 
though the absolute value of energy in CHARMM has no physical sense and could be 
arbitrarily shifted, it is still true that the Coulombic forces give the main contribution 
into the energy differences between configurations (11.74 kcal/mol out of 19,83 kcal/mol 
between the lowest and the highest configurations of Table VI). So in this case, we could 
think that the helical shape is created mainly by electrostatics. 

The situation, however, is not that simple: exactly the same form of the helical lowest 
minimum is stabilized in case of larger e values mostly by non-electrostatic components. 

Looking at Table VII, where we give the same kind of splitting the energy into com- 
ponents for e = 10 and 80, we can see that in the e = 10 case, the energy difference of 
58.03 kcal/mol between the lowest and highest configurations of Table VII, is due mostly 
to the Lennard- Jones forces (45.08 kcal/mol). In the e = 80 case, the energy difference is 
8.36 kcal/mol, and consists mostly of Lennard- Jones (2.47 kcal/mol) and dihedrals' (4.09 
kcal/mol) contributions. The Coulombic contribution is negligeable (.4 kcal/mol). So in 
this case, the reason for obtaining the same shape of the molecule is quite different. In 
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all cases we can speak about cooperativity of various energy components in producing a 
helical shape. The Ramachandran plot for a single residue, with no dipole-dipole interac- 
tions, already gives us a setup for producing a helix; in the case of strong electrostatics, 
as the molecule becomes longer, Coulombic forces contribute to maintaining an a-helical 
shape. 
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4 CONCLUSIONS 

We studied here the energy-minimized structures of twenty (L-alanine). We confirmed 
that the lowest-energy structures are a-helices for a wide range of the dielectric constant. 
We found, however, that the role of different energy components in stabilizing these 
structures is quite different. At the same time, the value of e plays little role in the minima 
distribution in the configuration (pip space. Our main results bear upon the evidence for 
ultrametric behavior of lowest-energy, a-helical structures and, independently, of higher- 
energy, disordered structures. We give a qualitative argument for ultrametricity of lower- 
energy configurations. 

We also find, in agreement with previous studies of the critical length for polyalanine 
and other polypeptide molecules, that a-helical structure becomes favorable once it has 
reached a certain length (around ten residues). 

We would like to thank J.Smith for useful discussions and suggestions. 
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partially minimized 


fully minimized 


taken for 


energy range of 




MD 


MC 


MC e 


total 


taken for 


nonidentical 


ultrametr. 


fully minimized 












minimization 




tests 


conf. (kcal/mol) 


e = 1 


697 


86 


16 


799 


226 


133 


133 


-591.202 to -571.371 


e = 10 


150 


36 


86 


272 


84 


69 


69 


-116.285 to -53.715 


e = 80 


350 


16 


86 


452 


32 


18 


61 


-68.519 to -59.537 



Table I. Number of configurations of 20-alanine obtained during different simulations. 
All obtained configurations were at least partially minimized (2000 cycles of ABNR). Of 
these, a lower-energy fraction were fully minimized. MC e denotes those MC-generated 
configurations which were generated at a different value of e than the one used during 
minimization. In the e = 80 case, the 18 fully-minimized nonidentical configurations 
were complemented by 43 partially-minimized configurations for the ultrametricity test 
purposes. 
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structure 


energy, kcal/mol 


initial: 
minimized: 


222176511266115526 
222126511166115126 


-387.2929 
-496.8968 



Table II. An example of a structural change during minimization. The integers refer 
to the labeling introduced in Figure 2. 
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energy 
kcal / mol 


222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222221 
222222222222222221 
222222222222222222 
422222222222222222 
422222222222222222 
222222222222222222 


-591.202 
-591.099 
-590.401 
-590.135 
-589.997 
-589.791 
-589.509 
-589.437 
-589.163 
-588.895 
-588.827 
-588.748 
-588.743 
-588.690 


142222222222222221 


-586.820 


141722222222222222 


-572.014 


333333333333333333 


-551.657 


126265116675755111 


-476.187 



Table III. Minimized configurations, e = 1. The left-handed helix (3^ 18 ^) has about 40 
kcal/mol higher energy than the lowest a-helix. 
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energy 
kcal / mol 


222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
222222222222222222 
272222222222222222 
272222222222222222 


-113.285 
-112.635 
-112.380 
-111.798 
-111.748 
-111.731 
-111.555 
-111.493 
-110.843 
-110.135 
-109.234 
-109.168 
-107.925 
-106.445 


176421721722222177 


-100.896 


522228872222222235 


-95.879 


561576113297244968 


-53.715 







Table IV. Minimized configurations, e = 10. 
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1 7 

2 2 



112611151611114258 
122222221151213555 



213221218724911151 



561576113297244958 
561576113297244968 



energy 
kcal / mol 



Table V. Minimized configurations, e = 80. 
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energy 


bonds 


angles 


dihedrals 


impropers 


vdW 


Coulomb 


^Q1 9(19 

—Oct 1 .ZiUZ! 


fl 977 


A fl7fl 


^ "ifi7 




D91 


^/n 9^ 


^Q1 DQQ 


n 97^ 


^ Q98 




1 9D1 




^/H fi98 




n ^1 ^ 

u .o ±o 


A 09^ 


tJ.OUO 


1 91 7 


99^ 






fl 9Q7 




^ fi^Q 


1 97^ 


U 1 .04:1 


• J-±- J . - J X X 


p;qq qq7 




O.OOcf 




1 9^4 




^49 7Q^ 


-572.478 


0.452 


4.712 


6.755 


1.418 


-50.313 


-535.501 


-572.430 


0.445 


5.197 


6.878 


1.634 


-55.455 


-531.129 


-572.314 


0.462 


4.538 


6.528 


1.518 


-55.084 


-530.275 


-572.244 


0.465 


4.536 


6.508 


1.516 


-55.013 


-530.256 


-572.014 


0.446 


7.293 


6.650 


1.662 


-54.686 


-533.378 


-571.371 


0.447 


4.864 


7.218 


1.760 


-54.144 


-531.515 


values averaged over 133 lower minima: 


-580.967 


0.398 


4.733 


6.652 


1.440 


-55.008 


-539.182 



Table VI. Energy components for the lower-energy (mostly a-helical) e = 1 configura- 
tions. 
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C 






q in rrlpc 


n i npnT^ It; 

VX1JXCVX1 dlQ 


1JX1LJ1 UUCI a 


vHW 

V IX V V 


( ,011 lnmh 

V_y VJ LIXU1JXU 


10 


-111.748 


0.626 


5.153 


4.440 


0.132 


-70.719 


-51.379 


i n 


XXX. tjtjtj 


61 6 

W . VJ X VJ 


4 71 

t: . 1 XW 


4 1 77 


116 


-70 752 

I W. 1 *JZj 


-50 421 


1 n 

x w 


-1 1 1 493 

XXX .^Ju 


560 

W . *jvj w 


4 739 


4 278 

t: . Zj I (J 


122 

VJ . X Zj Zj 


-70 750 

1 W. 1 JU 


-50 441 

JU. 11 X 


10 


-54.822 


1.220 


7.558 


4.813 


0.155 


-26.431 


-42.137 


i n 


-^4 034 


1 949 


7 831 


5 D39 


n 1 70 


-9fi 148 

ZjVJ. 14:0 


-49 168 

t:Zj . XVJO 


1 

lu 


-53 71 5 

tJtJ. I XtJ 


1 243 

X . Zj t: - J 


7 81 5 


4 848 


1 55 

VJ. XtJtJ 


-25 639 

ZjtJ . VJO <J 


-49 1 37 


80 


-67.491 


0.635 


4.988 


3.791 


0.086 


-70.700 


-6.291 


80 


-68.519 


0.616 


5.108 


4.039 


0.095 


-72.016 


-6.361 


80 


-67.613 


0.671 


5.018 


4.025 


0.089 


-71.154 


-6.261 


80 


-60.446 


0.847 


5.386 


5.662 


0.050 


-66.622 


-5.770 


80 


-59.537 


0.997 


6.641 


8.864 


0.143 


-70.464 


-5.717 


80 


-59.132 


0.997 


5.920 


7.881 


0.113 


-68.225 


-5.817 



Table VII. Energy components for typical lower-energy (a-helical) e = 10 and 
configurations. 
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FIGURE CAPTIONS 

Figure 1: Numbering of atoms for 20-alanine. The period for the intermediate ( ) 

region is ten atoms long; one finds, e.g., nitrogen atoms at 33, 43, 53,. .,153,... 

Figure 2: Energy minima of 20-alanine on a typical (f)ip plane: (a) e = 1, (b) e = 80 

Figure 3: 0-^-plane splitting generating splitting of the 36-dimensional 0^-space into 
"boxes". 

Figure 4: Chain tracing of the lowest-energy e = 1 configuration. The hydrogen bonds 
are given as broken lines. 

Figure 5: Ultrametricity test for 67 out of 133 lower-energy minima, e = 1. The graph 
consists of 47905 points: for each triangle in configuration space formed by three energy 
minima, the length of the second longest side d 2 is plotted against that of the longest side 
d±. In the ideally ultrametric case, all points would lie on the diagonal y = x. A subset of 
only 67 minima was chosen for this graph due to the printer's memory limitations; actual 
computation was done for 383306 points generated by 133 minima. 

Figure 6: Ultrametricity test for 69 lower-energy minima, e = 10. The graph consists of 
52394 points: for each triangle in configuration space formed by three energy minima, the 
length of the second longest side d 2 is plotted against that of the longest side d±. In the 
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ideally ultrametric case, all points would lie on the diagonal y — x. 

Figure 7: Ultrametricity test for 61 lower-energy minima, e = 80. The graph consists of 
35990 points: for each triangle in configuration space formed by three energy minima, the 
length of the second longest side d 2 is plotted against that of the longest side d±. In the 
ideally ultrametric case, all points would lie on the diagonal y = x. 

Figure 8: A sketch of the hierarchy of energy minima in (pip space corresponding to a 
change in a given turn of the helix, in two turns, etc... 

Figure 9: Frequencies of encountering an a- helical portion of length n among the e = 1 
minimized configurations, (n = 18 corresponds to a complete a- helix). 

Figure 10: The energy histograms for e = 1 minimized configurations: the y-axis repre- 
sents the number of configurations falling into each 3 kcal/mol energy bin. 

dashed line: configurations with helical content n < 9. 

solid line: configurations with helical content n > 9. 

Figure 11: Energies spanned by configurations characterized by a given helical content n. 

Figure 12: Frequencies of encountering an a-helical portion of length n among the e = 10 
minimized configurations, (n = 18 corresponds to a complete a-helix). 
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Figure 13: Frequencies of encountering an ct-helical portion of length n among the e = 80 
minimized configurations, (n = 18 corresponds to a complete a-helix). 
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